BabyDoctor is a multimodal large language model that combines the capabilities of CLiP and LLaMA 2. It can understand and generate text while also comprehending images. The model has been fine-tuned specifically for interpreting radiology images such as X-rays, ultrasounds, MRIs, and CT scans.
Image-to-Text
Transformers English